# ViT Backbone Network
Checkpoint Aerial Mast3r
AerialMegaDepth is a deep learning model focused on aerial-ground reconstruction and view synthesis, capable of reconstructing 3D scenes from aerial images and generating new viewpoints.
3D Vision
C
kvuong2711
15
0
Dpt Large Ade20k
MIT
A Transformer-based semantic segmentation model optimized for the ADE20K dataset
Image Segmentation
Safetensors
D
smp-hub
279
0
Vit Base Patch16 224.orig In21k
Apache-2.0
An image classification model based on Vision Transformer, pretrained on ImageNet-21k, suitable for feature extraction and fine-tuning
Image Classification
Transformers

V
timm
23.07k
1
Samvit Base Patch16.sa1b
Apache-2.0
Segment-Anything Vision Transformer (SAM ViT) image feature model, which only includes feature extraction and fine-tuning capabilities, without a segmentation head.
Image Segmentation
Transformers

S
timm
2,756
1
Featured Recommended AI Models